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CLAIMS 

Having thus described my invention, what I claim as new and desire to secure by Letters 
Patent is as follows: 



1 LA method for extracting and transforming content from a source page for 

2 transmission to a mobile device, said method comprising the steps of: 

3 (1) generating a stylesheet, wherein said stylesheet includes 

4 information indicating the content to be extracted from said source page and 

5 transformation information for manipulating the content; 

h6 (2) receiving, from the mobile device, a request to display said source 

SJ7 page; 

018 (3) applying said stylesheet to said source page to produce a 

9 destination page, wherein said destination page includes said extracted content 

110 manipulated according to said transformation information; and 
,il (4) transmitting said destination page to said mobile device. 

: ^1 2. The method of claim 1, wherein step 3 comprises the steps of: 

2 (1) retrieving said source page from a web server; and 

3 (2) identifying said content to be extracted using a site mining 

4 expression. 

1 3. The method of claim 1, further comprising the step of determining a site 

2 mining expression for uniquely locating said content to be extracted. 

1 4. The method of claim 1, wherein step 1 comprises the steps of: 
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2 (1) receiving and storing to a site mining template said information 

3 indicating said content to be extracted and said transformation information for 

4 manipulating the content; and 

5 (2) compiling said template to produce said stylesheet. 

1 5. The method of claim 1, wherein said source page comprises a XML 

2 compliant document. 

1 6. The method of claim 1, wherein said source page comprises a HTML 

.2 document. 

IjlJI 7. A method for generating a stylesheet, said method comprising the steps of: 
H2 (1) receiving an indication of an item of content to be extracted from a 

"i3 source page containing one or more items of content; 
r 4 (2) determining an expression for uniquely locating said item of 

:Jp content to be extracted; 

OS (3) receiving transformation information for manipulating said item of 

7 content; 

8 (4) storing said transformation information and said expression to a 

9 site mining template; and 

10 (5) converting said transformation information and said expression 

1 1 stored in said template to a stylesheet utilizable for mining content from said 

12 source page to produce a destination page containing said extracted content. 
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1 8. The method of claim 7, further comprising the step of receiving format 

2 information for formatting a layout of the stylesheet. 

1 9. The method of claim 7, further comprising the steps of: 

2 (1) receiving an indication of said source page; 

3 (2) retrieving said source page; and 

4 (3) displaying said one or more items of content contained in said 

5 source page for allowing a selection of said content to be extracted. 

"3L 10. The method of claim 7, wherein said transformation information includes 

UJ2 procedural tags for controlling a processing routine in said stylesheet. 

f 1 11. The method of claim 7, wherein said transformation information includes 

iy2 transformation tags for manipulating content extracted from said source page in 

K3 said stylesheet. 

**! 12. The method of claim 7, wherein said item of content is delineated by one 

2 or more tags. 

1 13. The method of claim 7, wherein step 5 comprises the step of compiling 

2 said template with a two pass compilation process, wherein a first pass generates a 

3 main body of said stylesheet and a second pass generates commands located 

4 outside of said main body. 
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1 14. The method of claim 7, wherein step 2 further comprises the step of 

2 receiving filtering criteria for indicating content to be extracted, wherein said 

3 criteria includes at least one of: selecting a single item of content located at a 

4 particular position, siblings of said item of content, similarly named siblings of 

5 said item of content, similarly named items of content located anywhere within 

6 said source page, and content containing specific text. 

1 15. The method of claim 7, wherein step 2 further comprises the steps of: 
™2 (1) receiving an indication of a root element; and 

C§3 (2) displaying content stemming from said root element, wherein said 

ff4 content to be extracted is selected from said item of content stemming from said 

KSt(S 

WS root element, and wherein said expression is determined by combining an 

f 6 expression locating said root element with an expression locating said selected 

h7 content relative to said root element. 

Pi 16. The method of claim 7, wherein said source page comprises a XML 

2 compliant document. 

1 17. The method of claim 7, wherein said source page comprises a HTML 

2 document. 

1 18. The method of claim 7, wherein said expression comprises an XPath 

2 syntax expression. 



27 



109085-123 



PATENT 



1 19. The method of claim 7, wherein said stylesheet includes a XSLT 

2 stylesheet. 

1 20. A method for generating a site mining expression for use in locating one 

2 item of content of a plurality of items of content contained in a source page, said 

3 method comprising the steps of: 

4 (1) displaying said plurality of items of content on a graphical user 

5 interface hierarchically in tree view form; 

_6 (2) receiving a selection for said one item of content, wherein said one 

Q 

JJ7 item of content is to be extracted from said source page; 

Qi8 (3) displaying any graphical components of said one item of content 

0i9 selected in step 2; and 

f 10 (4) generating a site mining expression for locating said one item of 

j||l content in said source page, wherein said site mining expression is capable of 

i -in 

f|2 locating content in a document written in an extensible markup language. 

1 21. The method of claim 20, wherein said site mining expression comprises an 

2 XPath expression. 

1 22. The method of claim 20, further comprising receiving the step of filtering 

2 criteria for indicating content to be extracted, wherein said criteria includes at 

3 least one of: selecting a single item of content located at a particular position, 

4 siblings of said item of content, similarly named siblings of said item of content, 
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5 similarly named items of content located anywhere within said source page, and 

6 content containing specific text. 

1 23. The method of claim 20, further comprising the steps of: 

2 (1) receiving a designation of an item of content as a root element; and 

3 (2) displaying items of content stemming from said root element, 

4 wherein said item of content to be extracted is selected from said item of content 

5 stemming from said root element, and wherein said expression is determined by 

6 combining an expression locating said root element with an expression locating 
07 said item of content to be extracted relative to said root element. 

fAl 24. A system for extracting and transforming content from a source page for 

W'2 transmission to a mobile device, said system comprising: 
r f 3 a central computer comprising: 

I "j4 a processor utilizable for generating a stylesheet, wherein said stylesheet 

p5 includes information indicating the content to be extracted from said source page 

6 and transformation information for manipulating the content; 

7 an interface in communication with said processor for receiving, from the 

8 mobile device, a request to display said source page; 

9 wherein, upon receiving said request, said processor applies said stylesheet 

10 to said source page to produce a destination page which includes said extracted 

1 1 content manipulated according to said transformation information; and 

12 wherein said interface transmits said destination page to said mobile 

13 device. 
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1 25. The system of claim 24, wherein said processor applies said stylesheet by 

2 retrieving said source page from a web server; and by identifying said content to 

3 be extracted using a site mining expression. 

1 26. The system of claim 24, wherein said processor is further capable of 

2 determining a site mining expression for uniquely locating said content to be 

3 extracted. 

1 27. The system of claim 24, wherein said processor generates said stylesheet 

S 2 b y : 

Li|3 receiving and storing to a site mining template said information indicating 

H4 said content to be extracted and said transformation information for manipulating 

: 5 the content; and 

6 compiling said template to produce said stylesheet. 

yi 28. The system of claim 24, wherein said source page comprises a XML 

2 compliant document. 

1 29. The system of claim 24, wherein said source page comprises a HTML 

2 document. 

1 30. A system for generating a stylesheet, said system comprising: 

2 a central computer comprising: 
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3 an interface for receiving an indication of an item of content to be 

4 extracted from a source page containing one or more items of content and for 

5 receiving transformation information for manipulating said item of content; 

6 a processor in communication with said interface, wherein said processor 

7 is capable of determining an expression for uniquely locating said item of content 

8 to be extracted; 

9 a memory for storing a site mining template, said template including said 

10 transformation information and said expression; and 

11 a compiler implementable by said processor for converting said 
JJ2 transformation information and said expression stored in said template to a 
*4& stylesheet utilizable for mining content from said source page to produce a 
Jj4 destination page containing said extracted content. 

Ml 31. The system of claim 30, wherein said interface is capable of: 
|72 receiving an indication of said source page; 

5J3 retrieving said source page; and 

4 transmitting said one or more items of content contained in said source 

5 page to a display for allowing a selection of said content to be extracted. 

1 32. The system of claim 30, wherein said transformation information includes 

2 procedural tags for controlling a processing routine in said stylesheet. 
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1 33. The system of claim 30, wherein said transformation information includes 

2 transformation tags for manipulating content extracted from said source page in 

3 said stylesheet. 

1 34. The system of claim 30, wherein said item of content is delineated by one 

2 or more tags. 

1 35. The system of claim 30, wherein said compiler converts said information 

2 using a two pass compilation process, wherein a first pass generates a main body 
y!3 of said stylesheet and a second pass generates commands located outside of said 
UM main body. 

vil 36. The system of claim 30, wherein said processor determines said 

M2 expression by receiving filtering criteria via said interface for indicating content to 

H3 be extracted, wherein said criteria includes at least one of: selecting a single item 

air 

« 

Jr 4 of content located at a particular position, siblings of said item of content, 

5 similarly named siblings of said item of content, similarly named items of content 

6 located anywhere within said source page, and content containing specific text. 

1 37. The system of claim 30, wherein said processor determines said 

2 expression by: 

3 receiving, via said interface, an indication of a root element; and 

4 transmitting content stemming from said root element to a display, 

5 wherein said content to be extracted is selected, using said display, from said item 
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6 of content stemming from said root element, and wherein said expression is 

7 determined by combining an expression locating said root element with an 

8 expression locating said selected content relative to said root element. 

1 38. The system of claim 30, wherein said source page comprises a XML 

2 compliant document. 

1 39. The system of claim 30, wherein said source page comprises a HTML 

2 document. 

jgi 40. The system of claim 30, wherein said expression comprises an XPath 

U2 syntax expression. 

i il 41 . The system of claim 30, wherein said stylesheet includes a XSLT 

H2 stylesheet. 

i : I 42. A system for generating a site mining expression for use in locating one 

C' 

2 item of content of a plurality of items of content contained in a source page, said 

3 system comprising: 

4 a central computer comprising: 

5 an interface for transmitting said plurality of items of content to a 

6 graphical user interface for hierarchically display in tree view form, said interface 

7 being capable of receiving a selection from said graphical user interface for said 

8 one item of content, wherein said one item of content is to be extracted from said 

9 source page, wherein upon receiving said selection said interface transmits any 
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10 graphical components of said one item of content for display on said graphical 

1 1 user interface; and 

12 a processor in communication with said interface and capable of 

13 generating a site mining expression for locating said one item of content in said 

14 source page, wherein said site mining expression is capable of locating content in 

15 a document written in an extensible markup language. 

1 43. The system of claim 42, wherein said site mining expression comprises an 

2 XPath expression. 

^jl 44. A system for extracting and transforming content from a source page for 

y;2 transmission to a mobile device, said system comprising: 
CF-3 a server comprising a processor and a memory, wherein said processor is 

:" A capable of: 

generating a stylesheet, wherein said stylesheet includes information 

06 indicating the content to be extracted from said source page and transformation 

7 information for manipulating the content; 

8 receiving, from the mobile device, a request to display said source page; 

9 applying said stylesheet to said source page to produce a destination page, 

10 wherein said destination page includes said extracted content manipulated 

11 according to said transformation information; and 

12 transmitting said destination page to said mobile device. 
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1 45. The system of claim 44, wherein said processor is further capable of 

2 determining a site mining expression for uniquely locating said content to be 

3 extracted. 

1 46. The system of claim 44, wherein said stylesheet is generated by: 

2 receiving and storing to a site mining template said information indicating 

3 said content to be extracted and said transformation information for manipulating 

4 the content; and 

, J5 compiling said template to produce said stylesheet. 

Wl 47. A system for generating a stylesheet, said system comprising: 

^2 a server comprising a processor and a memory, wherein said processor is 

3 capable of: 

jr. A receiving format information for formatting a layout of the stylesheet; 

!fj5 receiving an indication of an item of content to be extracted from a source 

C 6 page containing one or more items of content; 

7 determining an expression for uniquely locating said item of content to be 

8 extracted; 

9 receiving transformation information for manipulating said item of 

10 content; 

1 1 storing said format information, said transformation information, and said 

12 expression to a site mining template; and 
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13 converting said transformation information and said expression stored in 

14 said template to a stylesheet utilizable for mining content from said source page to 

15 produce a destination page containing said extracted content. 

1 48. The system of claim 47, wherein said transformation information includes 

2 transformation tags for manipulating content extracted from said source page in 

3 said stylesheet. 

1 49. The system of claim 47, wherein said expression is determined by 

Jl receiving filtering criteria for indicating content to be extracted, wherein said 
criteria includes at least one of: selecting a single item of content located at a 

1^4 particular position, siblings of said item of content, similarly named siblings of 

015 said item of content, similarly named items of content located anywhere within 

f 6 said source page, and content containing specific text. 

iff 1 50. The system of claim 47, wherein said expression is determined by: 
02 receiving an indication of a root element; and 

3 displaying content stemming from said root element, wherein said content 

4 to be extracted is selected from said item of content stemming from said root 

5 element, and wherein said expression is determined by combining an expression 

6 locating said root element with an expression locating said selected content 

7 relative to said root element. 
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1 5 1 . A system for generating a site mining expression for use in locating one 

2 item of content of a plurality of items of content contained in a source page, said 

3 system comprising: 

4 a server comprising a processor and a memory, wherein said processor is 

5 capable of: 

6 displaying said plurality of items of content on a graphical user interface 

7 hierarchically in tree view form; 

8 receiving a selection for said one item of content, wherein said one item of 
09 content is to be extracted from said source page; 

?B) displaying any graphical components of said one item of content; and 

j 1 generating a site mining expression for locating said one item of content in 

: 4|2 said source page, wherein said site mining expression is capable of locating 

l#3 content in a document written in an extensible markup language, 

p-1 52, The system of claim 51, wherein said processor is further capable of 

2 receiving filtering criteria for indicating content to be extracted, wherein said 

3 criteria includes at least one of: selecting a single item of content located at a 

4 particular position, siblings of said item of content, similarly named siblings of 

5 said item of content, similarly named items of content located anywhere within 

6 said source page, and content containing specific text. 

1 53. The system of claim 51, wherein said processor is further capable of: 

2 receiving a designation of an item of content as a root element; and 
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3 displaying items of content stemming from said root element, wherein said 

4 item of content to be extracted is selected from said item of content stemming 

5 from said root element, and wherein said expression is determined by combining 

6 an expression locating said root element with an expression locating said item of 

7 content to be extracted relative to said root element. 

1 54. A computer program implemented on a computer-readable medium for 

2 extracting and transforming content from a source page for transmission to a 

3 mobile device, said program comprising: 

CM computer-readable instructions for generating a stylesheet, wherein said 

stylesheet includes information indicating the content to be extracted from said 

rift 

7j6 source page and transformation information for manipulating the content; 

%J1 computer-readable instructions for receiving, from the mobile device, a 

M8 request to display said source page; 

H9 computer-readable instructions for applying said stylesheet to said source 

" 

*2jp page to produce a destination page, wherein said destination page includes said 

1 1 extracted content manipulated according to said transformation information; and 

12 computer-readable instructions for transmitting said destination page to 

13 said mobile device. 

1 55. The computer program of claim 54, wherein said instructions for applying 

2 comprises: 

3 computer-readable instructions for retrieving said source page from a web 

4 server; and 
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5 computer-readable instructions for identifying said content to be extracted 

6 using a site mining expression. 

1 56. The computer program of claim 54, further comprising computer-readable 

2 instructions for determining a site mining expression for uniquely locating said 

3 content to be extracted. 

1 57. The computer program of claim 54, wherein said instructions for 

2 generating further comprises: 

3 computer-readable instruction for receiving and storing to a site mining 
O4 template said information indicating said content to be extracted and said 

3 transformation information for manipulating the content; and 

L:6 computer-readable instructions for compiling said template to produce said 

\|7 stylesheet. 

l^l 58. The computer program of claim 54, wherein said source page comprises a 

p2 XML compliant document. 

1 59, The computer program of claim 54, wherein said source page comprises a 

2 HTML document. 

1 60. A computer program implemented on a computer-readable medium for 

2 generating a stylesheet, said program comprising: 
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3 computer-readable instructions for receiving an indication of an item of 

4 content to be extracted from a source page containing one or more items of 

5 content; 

6 computer-readable instructions for receiving determining an expression for 

7 uniquely locating said item of content to be extracted; 

8 computer-readable instructions for receiving transformation information 

9 for manipulating said item of content; 

10 computer-readable instructions for storing said transformation information 

1 1 and said expression to a site mining template; and 

412 computer-readable instructions for converting transformation information 

43 and expression stored in said template to a stylesheet utilizable for mining content 

J|4 from said source page to produce a destination page containing said extracted 

s 15 content. 

fTl 61. The computer program of claim 60, wherein said program further 

S'2 comprises: 

3 computer-readable instructions for receiving an indication of said source 

4 page; 

5 computer-readable instructions for retrieving said source page; and 

6 computer-readable instructions for displaying said one or more items of 

7 content contained in said source page for allowing a selection of said content to be 

8 extracted. 
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1 62. The computer program of claim 60, wherein said transformation 

2 information includes procedural tags for controlling a processing routine in said 

3 stylesheet. 

1 63. The computer program of claim 60, wherein said transformation 

2 information includes transformation tags for manipulating content extracted from 

3 said source page in said stylesheet. 

1 64. The computer program of claim 60, wherein said item of content is 

yp2 delineated by one or more tags. 

'f s 'l 65. The computer program of claim 60, wherein said instructions for 

% j2 converting further comprises computer-readable instructions for compiling said 

M3 template with a two pass compilation process, wherein a first pass generates a 

ff4 main body of said stylesheet and a second pass generates commands located 

"Zl5 outside of said main body. 

1 66. The computer program of claim 60, wherein said instructions for 

2 determining an expression further comprises computer-readable instructions for 

3 receiving filtering criteria for indicating content to be extracted, wherein said 

4 criteria includes at least one of: selecting a single item of content located at a 

5 particular position, siblings of said item of content, similarly named siblings of 

6 said item of content, similarly named items of content located anywhere within 

7 said source page, and content containing specific text. 
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1 67. The computer program of claim 60, wherein said instructions for 

2 determining an expression further comprises a 

3 computer-readable instructions for receiving an indication of a root 

4 element; and 

5 computer-readable instructions for displaying content stemming from said 

6 root element, wherein said content to be extracted is selected from said item of 

7 content stemming from said root element, and wherein said expression is 

8 determined by combining an expression locating said root element with an 
Q) expression locating said selected content relative to said root element. 

|ji 68. The computer program of claim 60, wherein said source page comprises a 

BU XML compliant document. 

h ft 69. The computer program of claim 60, wherein said source page comprises a 

fj ; 2 HTML document. 

1 70. The computer program of claim 60, wherein said expression comprises an 

2 XPath syntax expression. 

1 71. The computer program of claim 60, wherein said stylesheet includes a 

2 XSLT stylesheet. 
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1 72. A computer program implemented on a computer-readable medium for 

2 generating a site mining expression for use in locating one item of content of a 

3 plurality of items of content contained in a source page, said program comprising: 

4 computer-readable instructions for displaying said plurality of items of 

5 content on a graphical user interface hierarchically in tree view form; 

6 computer-readable instructions for receiving a selection for said one item 

7 of content, wherein said one item of content is to be extracted from said source 

8 page; 

O? computer-readable instructions for displaying any graphical components of 

M) said one item of content; and 

jid computer-readable instructions for generating a site mining expression for 

%> locating said one item of content in said source page, wherein said site mining 

if 3 expression is capable of locating content in a document written in an extensible 

fj4 markup language. 

1 73. The computer program of claim 72, wherein said site mining expression 

2 comprises an XPath expression. 

1 74. The computer program of claim 72, further comprising computer-readable 

2 instructions for receiving filtering criteria for indicating content to be extracted, 

3 wherein said criteria includes at least one of: selecting a single item of content 

4 located at a particular position, siblings of said item of content, similarly named 
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siblings of said item of content, similarly named items of content located 
anywhere within said source page, and content containing specific text. 



1 75 . The computer program of claim 72, further comprising: 

2 computer-readable instructions for receiving a designation of an item of 

3 content as a root element; and 

4 computer-readable instructions for displaying items of content stemming 

5 from said root element, wherein said item of content to be extracted is selected 

6 from said item of content stemming from said root element, and wherein said 
□7 expression is determined by combining an expression locating said root element 
/$ with an expression locating said item of content to be extracted relative to said 
T]9 root element. 

s 1 76. A system for extracting and transforming content from a source page for 

j y 2 transmission to a mobile device, said system comprising: 

jp-3 means for generating a stylesheet, wherein said stylesheet includes 

4 information indicating the content to be extracted from said source page and 

5 transformation information for manipulating the content; 

6 means for receiving, from the mobile device, a request to display said 

7 source page; 

8 means for applying said stylesheet to said source page to produce a 

9 destination page, wherein said destination page includes said extracted content 

10 manipulated according to said transformation information; and 

1 1 means for transmitting said destination page to said mobile device. 

44 



109085-123 



PATENT 



1 77. The system of claim 76, wherein said means for applying comprises: 

2 means for retrieving said source page from a web server; and 

3 means for identifying said content to be extracted using a site mining 

4 expression. 

1 78. The system of claim 76, further comprising means for determining a site 

2 mining expression for uniquely locating said content to be extracted. 

1 79. The system of claim 76, wherein said means for generating comprises: 
3> means for receiving and storing to a site mining template said information 

US indicating said content to be extracted and said transformation information for 

manipulating the content; and 
""15 means for compiling said template to produce said stylesheet. 

Hi 80. The system of claim 76, wherein said source page comprises a XML 

1^2 compliant document. 

1 81. The system of claim 76, wherein said source page comprises a HTML 

2 document. 

1 82. A system for generating a stylesheet, said system comprising: 

2 means for receiving an indication of an item of content to be extracted 

3 from a source page containing one or more items of content; 
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4 means for determining an expression for uniquely locating said item of 

5 content to be extracted; 

6 means for receiving transformation information for manipulating said item 

7 of content; 

8 means for storing said transformation information, and said expression to a 

9 site mining template; and 

10 means for converting said transformation information and expression 

1 1 stored in said template to a stylesheet utilizable for mining content from said 

12 source page to produce a destination page containing said extracted content. 

^{l 83 . The system of claim 82, further comprising means for receiving format 

M2 information for formatting a layout of said stylesheet, and means for storing said 

SB formation information to said template. 

[Jl 84. The system of claim 82, further comprising: 

Q2 means for receiving an indication of said source page; 

3 means for retrieving said source page; and 

4 means for displaying said one or more items of content contained in said 

5 source page for allowing a selection of said content to be extracted. 

1 85. The system of claim 82, wherein said transformation information includes 

2 procedural tags for controlling a processing routine in said stylesheet. 
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1 86. The system of claim 82, wherein said transformation information includes 

2 transformation tags for manipulating content extracted from said source page in 

3 said stylesheet. 

1 87. The system of claim 82, wherein said item of content is delineated by one 

2 or more tags. 

1 88. The system of claim 82, wherein said means for converting comprises 

2 means for compiling said template with a two pass compilation process, wherein a 
5f3 first pass generates a main body of said stylesheet and a second pass generates 
UH commands located outside of said main body. 

Cjl 89. The system of claim 82, wherein said means for determining comprises 

N2 means for further comprises receiving filtering criteria for indicating content to be 

extracted, wherein said criteria includes at least one of: selecting a single item of 

J?4 content located at a particular position, siblings of said item of content, similarly 

5 named siblings of said item of content, similarly named items of content located 

6 anywhere within said source page, and content containing specific text, 

1 90. The system of claim 82, wherein said means for determining further 

2 comprises: 

3 means for receiving an indication of a root element; and 

4 means for displaying content stemming from said root element, wherein 

5 said content to be extracted is selected from said item of content stemming from 
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6 said root element, and wherein said expression is determined by combining an 

7 expression locating said root element with an expression locating said selected 

8 content relative to said root element. 

1 91. The system of claim 82, wherein said source page comprises a XML 

2 compliant document. 

1 92. The system of claim 82, wherein said source page comprises a HTML 

2 document. 

JJl 93. The system of claim 82, wherein said expression comprises an XPath 

UJ2 syntax expression. 

Z d 94. The system of claim 82, wherein said stylesheet includes a XSLT 

U2 stylesheet. 

Jtjl 95. A system for generating a site mining expression for use in locating one 

'^2 item of content of a plurality of items of content contained in a source page, said 

3 system comprising: 

4 means for displaying said plurality of items of content on a graphical user 

5 interface hierarchically in tree view form; 

6 means for receiving a selection for said one item of content, wherein said 

7 one item of content is to be extracted from said source page; 

8 means for displaying any graphical components of said one item of 

9 content; and 
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10 means for generating a site mining expression for locating said one item of 

1 1 content in said source page, wherein said site mining expression is capable of 

12 locating content in a document written in an extensible markup language. 

1 96. The system of claim 95, wherein said site mining expression comprises an 

2 XPath expression. 

1 97. The system of claim 95, further comprising means for receiving filtering 

2 criteria for indicating content to be extracted, wherein said criteria includes at 
^3 least one of: selecting a single item of content located at a particular position, 

siblings of said item of content, similarly named siblings of said item of content, 

Gi5 similarly named items of content located anywhere within said source page, and 

Cfi6 content containing specific text. 

"nil 98. The system of claim 95, further comprising: 

•j|2 means for receiving a designation of an item of content as a root element; 

£--■3 and 

4 means for displaying items of content stemming from said root element, 

5 wherein said item of content to be extracted is selected from said item of content 

6 stemming from said root element, and wherein said expression is determined by 

7 combining an expression locating said root element with an expression locating 

8 said item of content to be extracted relative to said root element. 
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